RAW SEQUENCE LISTING DATE: 11/14/2001 

PATENT APPLICATION: US/09/74 3,823 TIME: 14:15:45 



Input Set : A:\6184844j.app 
Output Set: N:\CRF3\1114200l\I743823, raw 

3 <110> APPLICANT: FOGHER, CORRADO 

5 <120> TITLE OF INVENTION: A SYNTHETIC POLYNUCLEOTIDE CODING FOR HUMAN 

6 LACTOFERRIN, VECTORS / CELLS AND TRANSGENIC PLANTS 

7 CONTAINING IT 

9 <130> FILE REFERENCE: 618484 - 4/JP/B- 4075PCT 

11 <140> CURRENT APPLICATION NUMBER: 09/743,823 

12 <141> CURRENT FILING DATE: 2001-01-16 

14 <150> PRIOR APPLICATION NUMBER: IT RM98A000478 

15 <151> PRIOR FILING DATE: 1998-07-17 
17 <160> NUMBER OF SEQ ID NOS : 25 
19 <170> SOFTWARE: PatentIn Ver. 2.1 

21 <210> SEQ ID NO: 1 ^ 

22 <211> LENGTH: 2079 

23 <212> TYPE: DNA 

24 <213> ORGANISM: Artificial Sequence 

26 <220> FEATURE: 

27 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic DNA 
2 8 encoding human lactoferrin 

30 <2 20> FEATURE: 

CDS 

(1) . . (2076) 




31 <221> NAME/KEY 

32 <222> LOCATION 



34 <4 00> SEQUENCE 



35 


ggc 


cgt 


agg 


aga 


agg 


agt 


gtt 


caa 


tgg 


tgc 


gca 


gta 


tea 


caa 


cca 


gag 


48 


36 


Gly 


Arg 


Arg 


Arg 


Arg 


Ser 


Val 


Gin 


Trp 


Cys 


Ala 


Val 


Ser 


Gin 


Pro 


Glu 




37 


1 








5 










10 










15 






39 


gcc 


aca 


aaa 


tgc 


ttc 


caa 


tgg 


caa 


agg 


aat 


atg 


aga 


aaa 


gtt 


cgt 


gga 


96 


40 


Ala 


Thr 


Lys 


Cys 


Phe 


Gin 


Trp 


Gin 


Arg 


Asn 


Met 


Arg 


Lys 


Val 


Arg 


Gly 




41 








20 










25 










30 








43 


cct 


cct 


gta 


tct 


tgc 


ata 


aag 


aga 


gat 


tea 


ccc 


ate 


cag 


tgt 


ate 


cag 


144 


44 


Pro 


Pro 


Val 


Ser 


Cys 


He 


Lys 


Arg 


Asp 


Ser 


Pro 


He 


Gin 


Cys 


He 


Gin 




45 






35 










40 










45 










47 


gca 


att 


gcg 


gaa 


aac 


aga 


get 


gat 


get 


gtg 


act 


ctt 


gat 


ggt 


ggt 


ttc 


192 


48 


Ala 


He 


Ala 


Glu 


Asn 


Arg 


Ala 


Asp 


Ala 


Val 


Thr 


Leu 


Asp 


Gly 


Gly 


Phe 




49 




50 










55 










60 












51 


ata 


tac 


gag 


gca 


gga 


ctt 


gcc 


cca 


tac 


aaa 


ctg 


cga 


cct 


gta 


gcg 


gcg 


240 


52 


lie 


Tyr 


Glu 


Ala 


Gly 


Leu 


Ala 


Pro 


Tyr 


Lys 


Leu 


Arg 


Pro 


Val 


Ala 


Ala 




53 


65 










70 










75 










80 




55 


gaa 


gtc 


tac 


ggg 


acc 


gaa 


aga 


caa 


cca 


cga 


act 


cac 


tat 


tat 


get 


gtg 


288 


56 


Glu 


Val 


Tyr 


Gly 


Thr 


Glu 


Arg 


Gin 


Pro 


Arg 


Thr 


His 


Tyr 


Tyr 


Ala 


Val 




57 










85 










90 










95 






59 


get 


gtt 


gtg 


aag 


aag 


ggc 


gga 


tct 


ttt 


cag 


ctg 


aac 


gaa 


ctt 


caa 


ggt 


336 


60 


Ala 


Val 


Val 


Lys 


Lys 


Gly 


Gly 


Ser 


Phe 


Gin 


Leu 


Asn 


Glu 


Leu 


Gin 


Gly 




61 








100 










105 










110 








63 


ctg 


aag 


tea 


tgc 


cac 


aca 


gga 


ctt 


cgc 


agg 


acc 


get 


gga 


tgg 


aat 


gtc 


384 


64 


Leu 


Lys 


Ser 


Cys 


His 


Thr 


Gly 


Leu 


Arg 


Arg 


Thr 


Ala 


Gly 


Trp 


Asn 


Val 




65 






115 










120 










125 










67 


cct 


ata 


ggg 


aca 


ctt 


cgt 


cca 


ttc 


ttg 


aat 


tgg 


acg 


ggt 


cca 


cct 


gag 


432 
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Input Set : A:\6184844j.app 
Output Set: N:\CRF3\1114200I\I743823.raw 

68 Pro He Gly Thr Leu Arg Pro Phe Leu Asn Trp Thr Gly Pro Pro Glu 

69 130 135 140 

71 ccc att gag gca get gtg gca aga ttc ttc tea gee tct tgt gtt cca 480 

72 Pro He Glu Ala Ala Val Ala Arg Phe Phe Ser Ala Ser Cys Val Pro 

73 145 150 155 160 

75 ggt gca gat aaa gga caa ttc ccc aac ctt tgt cgc ctg tgt gcg ggg 528 

76 Gly Ala Asp Lys Gly Gin Phe Pro Asn Leu Cys Arg Leu Cys Ala Gly 

77 165 170 175 

79 aca ggg gaa aac aaa tgt gca ttc tea tec cag gaa ccg tac ttc age 576 

80 Thr Gly Glu Asn Lys Cys Ala Phe Ser Ser Gin Glu Pro Tyr Phe Ser 

81 180 185 190 

83 tac tct ggt gcc ttt aag tgt ctt aga gac ggt get gga gat gtt get 

84 Tyr Ser Gly Ala Phe Lys Cys Leu Arg Asp Gly Ala Gly Asp Val Ala 

85 195 200 205 

87 ttt att aga gag age aca gtg ttt gag gat ctt tea gac gag get gaa 672 

88 Phe He' Arg Glu Ser Thr Val Phe Glu Asp Leu Ser Asp Glu Ala Glu 

89 210 215 220 

91 agg gac gag tat gag tta etc tgc cca gac aac act cgt aag cca gtt 720 

92 Arg Asp Glu Tyr Glu Leu Leu Cys Pro Asp Asn Thr Arg Lys Pro Val 

93 225 230 235 240 

95 gac aag ttc aaa gat tgc cat ctt gca egg gtc cct tct cat gcc gtt 768 

96 Asp Lys Phe Lys Asp Cys His Leu Ala Arg Val Pro Ser His Ala Val 

97 245 250 255 

99 gtg gca cga agt gtt aat gga aag gag gat gcc ate tgg aat ctt etc 816 

100 Val Ala Arg Ser Val Asn Gly Lys Glu Asp Ala He Trp Asn Leu Leu 

101 260 265 270 

103 cgc caa gca cag gaa aag ttt gga aag gac aag tea ccg aaa ttc cag 864 

104 Arg Gin Ala Gin Glu Lys Phe Gly Lys Asp Lys Ser Pro Lys Phe Gin 

105 275 280 285 

107 etc ttt ggt tec cct agt ggg cag aaa gat ctt ctg ttc aag gac tct 912 

108 Leu Phe Gly Ser Pro Ser Gly Gin Lys Asp Leu Leu Phe Lys Asp Ser 

109 290 295 300 

111 gcc att ggg ttt teg aga gtg cca cct agg ata gat tct ggg ttg tac 960 

112 Ala He Gly Phe Ser Arg Val Pro Pro Arg He Asp Ser Gly Leu Tyr 

113 305 310 315 320 

115 ctt ggc tec gga tac ttt act gca att cag aac ttg agg aaa agt gag 1008 

116 Leu Gly Ser Gly Tyr Phe Thr Ala He Gin Asn Leu Arg Lys Ser Glu 

117 325 330 335 

119 gag gaa gtt get gcc egg cgt gcg egg gtc gtt tgg tgt gcg gtg gga 1056 

120 Glu Glu Val Ala Ala Arg Arg Ala Arg Val Val Trp Cys Ala Val Gly 

121 340 345 350 

123 gag caa gag ttg cgc aag tgt aac cag tgg agt ggt ttg age gaa gga 1104 

124 Glu Gin Glu Leu Arg Lys Cys Asn Gin Trp Ser Gly Leu Ser Glu Gly 

125 355 360 365 

127 tct gtg acc tgc tea teg gcc tec act aca gaa gat tgc ate gcc ctg 1152 

128 Ser Val Thr Cys Ser Ser Ala Ser Thr Thr Glu Asp Cys He Ala Leu 

129 370 375 380 

131 gtg ttg aaa gga gaa get gat gee atg agt ttg gat gga gga tat gtt 1200 

132 Val Leu Lys Gly Glu Ala Asp Ala Met Ser Leu Asp Gly Gly Tyr Val 
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gtg 
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get 


grg 
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4- (T 
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a y a 


u-oa 


y ao 


7^ nt 

a. ^ L- 


a fTP 
ay o 


1344 
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144 


Val 


Glu 


Gly 


Tyr 


jjeu 


Ala 


V ax 


Ala 

i\Xa 


V ai 


V aX 


A ITT 
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D tJX 




X liX 


O t-X 
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A AH 
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A 4 S 
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Cut 
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tgg 
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C.CL. 
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1488 

X *T CJ tJ 
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Thr 


Gly 


Ser 


Lys 


Lys 


T-> V-\ ^ 

Pne 


ASp 


r* 1 1 1 
VjXU 


Tyr 


T> 

Fne 


O Q -r^ 
OCX 


oxn 


oex 


oy o 


Ala 
AXa 


X o 




157 










4 O -> 










4 y u 










A Q S 






ICQ 

loy 


ggt 


tC L 


gac 


cca 


aga 


-f- /li- 
ce L. 


aa u 


jT"! +- /-^ 




y o c 


c cy 


L.y c 


aft 
a u o 


y y" 


y d o 


fta rr 


1536 
J- ~f ^ \j 


IbU 


Gly 


*^ 

ser 


Asp 


pro 


Arg 


C' 

ber 


TV 

Asn 


xeu 


cys 


AX a 


T 1 

XiciU 


oy a 


X Xti 


X y 


A cr^ 

AC3^ 


m n 

v3 X U. 




161 








C A 

500 










R A C\ 
DUD 










Sin 

J X u 








163 


caa 




gag 


aa L, 


aag 


L.gc 


gtt 


ccc 


aaC 


age 


aao 


gag 


ay a 


"h o 
cao 


■f" P 

oao 


yy ^ 


1 534 

X O 'x 


164 


Gin 


Gly 


Glu 


Asn 


Lys 


cys 


vax 


Fro 


TV 


oex 


A T% 


1 n 

oX U, 


AX y 


lyr 


lyx 


vjxy 




165 






515 










C O A 










D ZD 










167 


tac 


act 




get 


4~ 4- /-t 

t uC 


•A*- 

cgr 


4— jOf yn( 

tgc 


L. ug 


gc L. 


gag 


^ ^ 4" 

aa C 


/-f >o 4~ 

get 


gga 


y ao 


y c c 


y oa 


1 S 2 


168 


Tyr 


Thr 


Gly 


Ala 


Pne 


Arg 


Cys 


Leu 


Axa 


GXU 


TV v\ 

Asn 


Ala 

Axa 


\jxy 


ASp 


V aX 


Axa 




169 




c n t\ 

530 










c 










4 n 

D4 u 












171 


ttt 


gtg 


aaa 


gat 


gtc 


act- 


g L.C 


ung 


cag 




a ^4" 

ao C 


ga c 


gga 


aa c 


aao 


^1 f 
aao 


X U O V/ 


172 


Pne 


Val 


Lys 


Asp 


val 


Tnr 


vax 


T y^ * 1 

Leu 




TV rr^ 

Asn 


V* T» 

1 nr 


ASp 


LrXy 


A 

Asn 


A c* n 


A G T» 




173 


545 










CCA 










^ 

D D D 










J D U 




175 


gag 


gca 


tgg 


get 


aag 


gat 


ttg 


aag 


/->4- 4- 


gca 


y ao 


ccc 


rr ex 

gcg 


ttct 
c eg 


<"» +- /-f 

ccg 


L.y o 


17 28 
X / ^ u 


176 


Glu 


Ala 


Trp 


Ala 


Lys 


Asp 


Leu 


Xtys 


Xteu 


Ala 

Axa 


ASp 


Fxie 


AXa 


T Qi 1 

XfC^li 


T oil 


oy o 




-1 r-j n 

177 










c ^ c 

56 5 










C T A 
D / U 










7 S 
D / D 






179 


etc 


gat 


ggc 


aaa 


cgt 


aag 


cct 


gtg 


act 


gaa 


+" 

get 


aga 


age 


xgc 


ca c 


O C C 


1 7 7 
X / / D 


180 


Leu 


Asp 


Gly 


Lys 


Arg 


Lys 


Pro 


Vax 


m \^ 

Tiir 


GXU 


Axa 


TV j-tt 

Arg 


O -v^ 

ber 


cys 


nXS 


xeu 




181 








M y\ 

580 










r* 

585 










C O A 

590 








183 


gcc 


atg 


gcc 


ccg 


aat 


cat 


get 


gtg 


gtg 


tct 


cgt 


atg 


gat 


aag 


gtg 


gaa 


1824 


184 


Ala 


Met 


Ala 


Pro 


Asn 


His 


Ala 


Val 


Val 


Ser 


Arg 


Met 


Asp 


Lys 


Val 


Glu 




185 






595 










600 










605 










187 


cgc 


ttg 


aaa 


cag 


gtg 


ttg 


etc 


eac 


caa 


cag 


get 


aaa 


ttt 


ggt 


aga 


aat 


1872 


188 


Arg 


Leu 


Lys 


Gin 


Val 


Leu 


Leu 


His 


Gin 


Gin 


Ala 


Lys 


Phe 


Gly 


Arg 


Asn 




189 




610 










615 










620 












191 


gga 


tct 


gac 


tgc 


ccg 


gac 


aag 


ttt 


tgc 


tta 


ttc 


cag 


tct 


gaa 


acc 


aaa 


1920 


192 


Gly 


Ser 


Asp 


Cys 


Pro 


Asp 


Lys 


Phe 


Cys 


Leu 


Phe 


Gin 


Ser 


Glu 


Thr 


Lys 




193 


625 










630 










635 










640 




195 


aac 


ctt 


ttg 


ttc 


aat 


gac 


aac 


act 


gag 


tgt 


ctt 


gcc 


aga 


etc 


cat 


ggc 


1968 


196 


Asa 


Leu 


Leu 


Phe 


Asn 


Asp 


Asn 


Thr 


Glu 


Cys 


Leu 


Ala 


Arg 


Leu 


His 


Gly 




197 










645 










650 










655 
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Input Set : A:\6184844j.app 

Output Set: N:\CRF3\11142001\I743B23, raw 

199 aaa aca aca tat gaa aaa tat ttg gga cca cag tat gtc gca ggc att 2016 

200 Lys Thr Thr Tyr Glu Lys Tyr Leu Gly Pro Gin Tyr Val Ala Gly lie 

201 660 665 670 

203 act aat ctg aaa aag tgc tea acc tec cca etc eta gaa gee tgt gaa 2064 

204 Thr Asn Leu Lys Lys Cys Ser Thr Ser Pro Leu Leu Glu Ala Cys Glu 

205 675 680 685 

207 ttc eta agg aag taa 2079 

208 Phe Leu Arg Lys 

209 690 

212 <210> SEQ ID NO: 2 

213 <211> LENGTH: 30 

214 <212> TYPE: DNA 

215 <213> ORGANISM: Artificial Sequence 

217 <220> FEATURE: 

218 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic DNA 

220 <400> SEQUENCE: 2 

221 ggatccatgg gccgtaggag aaggagtgtt 30 

224 <210> SEQ ID NO: 3 

225 <211> LENGTH: 32 

226 <212> TYPE: DNA 

227 <213> ORGANISM: Artificial Sequence 

229 <220> FEATURE: 

230 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic DNA 

232 <400> SEQUENCE: 3 

233 gagctcettc ggttttactt cetgaggaat tc 32 

236 <210> SEQ ID NO: 4 

237 <211> LENGTH: 42 

238 <212> TYPE: DNA 

239 <213> ORGANISM: Artificial Sequence 

241 <220> FEATURE: 

242 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic DNA 
244 <400> SEQUENCE: 4 

24 5 tctagataaa ataatctata cattaaaaaa tttgatttta aa 42 
248 <210> SEQ ID NO : 5 
24 9 <211> LENGTH: 3 6 

250 <212> TYPE: DNA 

251 <213> ORGANISM: Artificial Sequence 

253 <220> FEATURE: 

254 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic DNA 

256 <400> SEQUENCE: 5 

257 ggatccgact gagteggata agaagaaaag aaaaga 36 

260 <210> SEQ ID NO: 6 

261 <211> LENGTH: 36 

262 <212> TYPE: DNA 

263 <213> ORGANISM: Artificial Sequence 

265 <220> FEATURE: 

266 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic DNA 

268 <400> SEQUENCE: 6 

269 tctagagttt tcaaatttga attttaatgt gtgttg 36 
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272 <210> SEQ ID NO : 7 

273 <211> LENGTH: 3 6 

274 <212> TYPE: DNA 

275 <213> ORGANISM: Artificial Sequence 

277 <220> FEATURE: 

278 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic DNA 

280 <400> SEQUENCE: 7 

281 ggatcccacc ttaaggaggt tgcaacgagc gtggca 36 

284 <210> SEQ ID NO: 8 

285 <211> LENGTH: 250 

286 <212> TYPE: DNA 

287 <213> ORGANISM: Artificial Sequence 

289 <220> FEATURE: 

290 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic DNA 
292 <400> SEQUENCE: 8 

2 93 ggccgtagga gaaggagtgt tcaatggtgc gcagtatcac aaccagaggc cacaaaatgc 60 

294 ttccaatggc aaaggaatat gagaaaagtt cgtggacctc ctgtatcttg cataaagaga 120 

295 gattcaccca tccagtgtat ccaggcaatt gcggaaaaca gagctgatgc tgtgactctt 180 

296 gatggtggtt tcatatacga ggcaggactt gccccataca aactgcgacc tgtagcggcg 240 

297 gaagtctacg 250 

300 <210> SEQ ID NO: 9 

301 <211> LENGTH: 250 

302 <212> TYPE: DNA 

303 <213> ORGANISM: Artificial Sequence 

305 <220> FEATURE: 

306 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic DNA 

308 <400> SEQUENCE: 9 

309 gcacctggaa cacaagaggc tgagaagaat cttgccacag ctgcctcaat gggctcaggt 60 

310 ggacccgtcc aattcaagaa tggacgaagt gtccctatag ggacattcca tccagcggtc 120 

311 ctgcgaagtc ctgtgtggca tgacttcaga ccttgaagtt cgttcagctg aaaagatccg 180 

312 cccttcttca caacagccac agcataatag tgagttcgtg gttgtctttc ggtcccgtag 240 

313 acttccgccg 250 

316 <210> SEQ ID NO: 10 

317 <211> LENGTH: 250 

318 <212> TYPE: DNA 

319 <213> ORGANISM: Artificial Sequence 

321 <220> FEATURE: 

322 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic DNA 
324 <400> SEQUENCE: 10 

32 5 aactggctta cgagtgttgt ctgggcagag taactcatac tcgtcccttt cagcctcgtc 60 

326 tgaaagatcc tcaaacactg tgctctctct aataaaagca acatctccag caccgtctct 120 

327 aagacactta aaggcaccag agtagctgaa gtacggttcc tgggatgaga atgcacattt 180 

328 gttttcccct gtccccgcac acaggcgaca aaggttgggg aattgtcctt tatctgcacc 240 

329 tggaacacaa 250 

332 <210> SEQ ID NO: 11 

333 <211> LENGTH: 255 

334 <212> TYPE: DNA 



335 
337 



<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 
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